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Summary: 

A  spaced  advertising  schedule  is  much  more  dollar-effective  than  is  a  massed 
(pulsed)  schedule,  according  to  this  re-analysis  of  the  experimental  data 
developed  by  Pomerance  and  Zielske.  This  finding  is  at  variance  with  the  con- 
clusions drawn  by  previous  writers.  The  raw  Pomerance-Zielske  data  are 
published  here  for  the  first  time;  until  now  only  idealized  representations 
of  the  data  have  been  available,  which  has  been  the  source  of  confusion  and 
misunderstanding . 
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INTRODUCTION 
The  subject  of  the  research  reported  in  Zielske's  well-known  article 
(1959)  is  the  comparison  of  massed  advertising  ("flighting"  or  "pulsing") 
versus  spaced  advertising  schedules.  This  is  made  clear  by  the  introduc- 
tory statement  in  the  less-well-known  earlier  report  on  the  same  work  by 
Pomerance  and  Zielske  (P-Z  hereafter) : 

Decisions  have  to  be  made  continuously  on  how  adver- 
tising should  be  scheduled.  For  instance,  should  it 
be  concentrated  in  an  intensive  burst,  or  should  it  be 
spread  over  a  longer  period?  Should  a  relatively  small 
group  of  people  be  exposed  many  times,  or  a  larger 
group  a  smaller  number  of  times? 

In  other  words,  what  is  the  rate  at  which  consumers 
can  be  made  to  remember  advertising,  and  the  rate  at 
which  they  forget  it?   (Pomerance  and  Zielske,  1958) 

The  conclusion  is  hedged  in  both  the  P-Z  and  Zielske  pieces.   They  say 
that  a  pulsed  schedule  will  produce  a  higher  peak  number  of  rememberers 
whereas  a  spaced  schedule  will  produce  a  higher  average  number  of  remem- 
berers . 

The  ultimate  business  goal,  however,  is  profit  rather  than  peaks  of 
remembrance.   Or  with  respect  to  a  given  expenditure  for  advertising, 
the  goal  is  to  maximize  the  dollars  of  sales.  Other  measures  of  adver- 
tising are  simply  proxies  for  sales,  and  ultimately  they  must  be  related 
to  sales.   The  issue,  then,  is  whether  massed  or  spaced  advertising  pro- 
duces more  sales  per  dollar  of  advertising. 

If  we  consider  recall  of  advertisements  as  a  proxy  for  sales — 
always  a  questionable  assumption,  but  an  assumption  that  underlies  the 
bulk  of  advertising  research — then  comparison  of  the  average  recalls 
during  the  entire  relevant  period  indicates  the  relative  selling  power  of 
two  equal-cost  campaigns.   By  Zielske's  calculation,  monthly  spaced  ad- 
vertising produces  29  percent  average  recall  over  the  year  compared  to 
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21  percent  for  the  thirteen  weekly  advertisements  (p.  241),  apparently 
an  advantage  for  spaced  advertising.   But  Strong's  simulation,  based  on 
the  P-Z  data  as  well  as  additional  data  of  his  own,  yields  estimates  of 
30.1  percent  and  26.5  percent  respectively,  a  somewhat  smaller  advantage 
for  spaced  advertising.  And,  Strong — using  data  he  collected  from  a 
bi-weekly  schedule,  and  the  P-Z  data  plus  some  of  his  own  monthly-exposure 
data — concluded  from  a  simulation  that  a  schedule  of  four  bi-weekly  flights 
of  4,  3,  3,  and  3  exposures  spaced  throughout  the  year  "delivers  an  aver- 
age annual  recall  of  32%,  two  percentage  points  greater  than  that  delivered 
by  the  monthly  schedule"  (1974,  p.  377),  i.e.^  an  advantage  for  pulsing. 
Rao  (1970) ,  and  Ackhof f  and  Emshoff  (1975a;  1975b)  have  also  asserted, 
and  the  proposition  apparently  is  generally  accepted,  that  in  some  circum- 
stances pulsing  can  produce  more  sales  dollars  per  dollar  of-  advertising 
than  spaced  advertising.  r 

My  re-analysis  in  this  paper  of  the  P-Z  data  suggests  that  pulsing 
is  never  worthwhile — at  least  for  the  conditions  of  the  P-Z  experiment.' 
Furthermore,  this  re-analysis  shows  a  massive  advantage  for  spaced  ad- 
vertising— selling  power  twice  as  great — which  is  a  much  greater  advantage 
than  suggested' by  either  the  Zielske  or  the  Strong  methods  of  calculation 
of  average  recall. 

In  addition  to  the  finding  that  spaced  advertising  is  much  more 
dollar-effective  than  is  massed  advertising,  this  article  presents  for 
the  first  time  the  raw  P-Z  data,  wherein  lies  an  instructive  tale  of 
scientific  history  which  is  told  in  Appendix  B.   The  very  important  and 
imaginative  study  that  produced  the  data  mentioned  in  the  title  was 
directed  by  Eugene  Pomerance  and  Hugh  (sic)  Zielske  (Pomerance  and 
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include  a  re-analysis  of  the  raw  P-Z  data.  And  I  had  just  become  aware  of 
Stewart's  massive  study  (1964)  that  used  the  P-Z  design  but  that  repro- 
duced Zielske's  diagrams  as  if  they  represented  actual  data  points. 
I  therefore  wrote  to  Zielske  as  follows  (July  13,  1965): 

I  would  like  to  write  a  note  for  the  Journal  of 
Marketing  commenting  on  the  shortcomings  of  your 
presentation  of  the  data.   I  could  best  do  so  if 
your  actual  data  were  shown.  But  of  course  the 
data  are  yours  and  I  assume  were  sent  to  me  in 
confidence. 

So  two  questions:   1)  Would  you  like  to  co-author 
the  note  with  me  and  include  the  data?  or,  2)  May 
I  hav 
Zielske  replied  that  he  "would  have  no  interest  in  co-authoring  a 

'note'"  and  he  refused  me  permission  to  quote  his  data,  on  the  grounds 

that  I  misquoted  him  in  my  1965  article.  He  added 

I  am  fully  aware  that  all  people  who  have  quoted 
[my]  article  have  not  also  always  quoted  the  qual- 
ifications or  the  information  that  was  included 
on  methodology.  But  whether  or  not  more  technical 
detail  would  have  reduced  this  is  very  doubtful. 

A  full  technical  description  of  how  the  data  were 
processed,  together  with  the  raw  data,  has  been 
available  and  distributed  to  all  who  requested 
such  information. 

At  the  time  the  article  was  submitted  for  publi- 
cation, back  in  1958,  the  Journal  felt  (correctly, 
I  believe)  that  there  would  be  insufficient  inter- 
est in  this  "technical  appendix"  to  warrant  the 
space  it  would  require,  and  it's  probably  safe  to 
assume  that  interest  would  be  even  less  now — seven 
years  later." 

Whether  rightly  or  wrongly  I  do  not  know,  but  I  accepted  Zielske's 

refusal  and  did  nothing  in  print  about  this  matter.   Recently,  however, 

I  (with  Arndt)  began  to  prepare  a  new  article  on  the  advertising  response 

function  and  economies  of  scale,  and  again  my  thoughts  turned  to  the  P-Z 
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data.   I  asked — and  this  time  received-- permission  from  Eugene  Pomerance, 
for  Foote,  Cone  and  Belding,  Inc.,  to  reproduce  the  raw  data.  Afterwards 
our  literature  search  revealed  that  Strong  (1974;  see  also  Strong,  1972) 
replicated  the  P-Z  study,  and  included  a  graphic  presentation  of  the 
original  P-Z  data  in  his  report.*  As  noted  earlier,  Strong's  data  confirm 
the  P-Z  data  where  they  overlap;  this  is  comforting,  and  strengthens  the 
substantive  analysis  presented  in  the  article. 

That  the  P-Z  data  have  not  been  in  print  is  more  than  regrettable. 
Considerable  misunderstanding  about  the  substantive  issue  of  spaced  versus 
massed  advertising  has  been  caused.   Furthermore,  as  Strong  put  it  "The 
accretion  of  wisdom  on  top  of  Zielske's  admittedly  brilliant  work  has 
been  severely  hampered  by  the  ill-advised  smoothing  he  performed"  (letter 
of  March  8,  1978). 


*Another  element  in  what  would  be  a  comedy  if  it  were  funny  is  that 
when  I  wrote  Dr.  Edward  Strong  (sic)  at  Stanford  asking  to  see  a  copy  of 
his  thesis,  I  received  a  reply  that  Dr.  Strong  had  died  in  1963.  This  made 
me  wonder  whether  my  belief  that  there  was  a  contemporary  Edward  Strong 
was  in  error,  and  the  confusion  was  compounded  in  a  MSI  working  paper  by 
Ray  (1975)  that  lists  the  1914  publication  of  Edward  K.  Strong  under  the 
same  author  heading  as  the  1972  thesis  of  Edward  C.  Strong. 
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Zielske,  1958) ,  though  the  attribution  is  invariably  to  Zielske  as  pub- 
lished in  Zielske  (1959)  .   In  neither  the  original  articles  are  the  raw 
data  shown,  but  rather  smoothed  idealized  graphs  of  them  are  given. 
Since  then  many  studies  have  referred  to  the  Zielske  article,  and  have 
treated  the  smoothed  figures  as  if  they  are  actual  data  points.   It  is 
only  the  exception  (e.g.,  Lodish,  1971;  Corkindale  and  Kennedy,  1975, 
p.  80)  which  has  mentioned  that  the  figures  are  idealized  and  do  not 
represent  actual  observations.  Lodish  also  noted  that  he  was  unable  to 
obtain  the  raw  P-Z  data  for  re-analysis,  and  therefore  used  the  idealized 
representations  for  his  empirical  work.   In  1963  I  obtained  the  raw  data 
from  an  internal  publication  of  Foote,  Gone,  and  Belding,  but  I  was  not 
able  to  obtain  permission  to  publish  them  until  now. 

THE  METHOD  AND  DATA 

The  main  details  of  the  P-Z  method  (for  more  general  background  see 

Zielske,  1959)  are  as  stated  in  the  internal  research  report  of  Foote, 

Cone  and  Belding  (1958) : 

A  total  of  5668  names  was  selected  from  the  1955  Chicago 
telephone  directory.  This  total  was  subdivided  into  26 
test  samples  of  143  respondents  each  and  26  control 
samples  of  75  respondents  each. . . . 

The  following  steps  show  in  detail  how  the  individual 
samples  were  selected: 

1.  5668  names  were  systematically  selected  from  the 
1955  Chicago  telephone  directory  by  choosing  ap- 
proximately every  160th  residential  listing.  The 
names,  together  with  phone  numbers  and  addresses, 
were  typed  on  the  questionnaires.  All  male  names 
were  preceded  with  "Mrs." 

2.  The  questionnaires,  which  were  kept  in  alphabetical 
sequence  throughout  the  selection  process,  were  then 
divided  into  26  groups  of  218  each.  The  first 
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questionnaire  was  placed  in  group  1,  the  second  in  group 
2,  etc.,  until  all  26  groups  were  complete.   Groups 
1  through  12  were  designated  as  the  weekly  exposure 
groups — test  respondents  in  these  groups  received  ads 
at  weekly  intervals .  Groups  13  through  26  were 
designated  as  the  four-week-interval  exposure 
groups — test  respondents  in  these  groups  received 
ads  at  intervals  of  four  weeks. 

3.  From  each  of  the  26  groups  of  218  questionnaires, 
approximately  every  third  one  was  selected  as 
control  respondents  who  received  no  ads  but  were 
interviewed  at  the  same  time  as  the  test  respon- 
dents.  The  purpose  of  the  control  samples  was  to 
see  what  the  level  of  recall  was  among  those  not 
exposed  to  the  ads.   (There  was  no  significant 
trend  in  remembrance  of  normally  scheduled  adver- 
tising for  the  product  among  the  control  groups.) 
There  were  75  respondents  in  each  control  sample 
and  143  in  each  test  sample. 

These  samples  yielded  an  average  of  91  completed  inter- 
views for  each  of  the  26  test  exposure  groups  and  51 
completed  interviews  for  each  control  group.  The  over- 
all completion  rate  was  65%.... 

Thirteen  different  ads  were  used  in  this  study.   In 
order  to  minimize  any  effect  ad  sequence  might  have  on 
the  results,  the  order  in  which  the  ads  were  mailed  to 
each  group  of  respondents  was  varied. 

To  help  insure  respondents'  opening  the  letters,  six 
different  colored  envelopes  with  personal  return 
addresses  were  used. 

All  interviews  were  made  by  telephone,  and  a  maximum 
of  five  calls  were  made  in  an  attempt  to  contact  each 
respondent. 

The  exposure  schedules,  interview  schedules,  and  results  for  the 

weekly  and  every-four-weeks  exposure  groups  were  as  shown  in  Table  1. 

The  results  are  the  recall  percentages  shown  in  the  tables.   These  data 

are  the  points  plotted  in  Figures  1  and  2  and  Table  1. 


Figures  1  and  2;  Table  1 
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TABLE  1 

Exposure  schedule  and  the  results  for  the  groups  that  received  advertise- 
ments weeklv. 


Number  of  Weeks  Number  of  Weeks 

Number  of  Ads     Between  Mailing  Of  Between  Mailing  of 

Received  At     First  Ad  to  Group  and  Last  Ad  to  Group  and 

Group  Weekly  Intervals   Start  of  Interviewing  Start  of  Interviewing 


Recall 

Percentage 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 


1 

3 

5 

7 

9 

11 

13 

13 

13 

13 

13 

13 


1 

3 

5 

7 

9 

11 

13 

17 

25 

33 

41 

49 


1 

1 

1 

1 

1 

1 

1 

5 

13 

21 

29 

37 


13 

42 

67* 

49 

54 

58 

62 

30 

16 

8 

2 
? 


Exposure  schedule  and  results  for  the  groups  that  received  advertisements 
every  four  weeks. 


Number  of 

Weeks  Between 

Mailing  of 

Number 

of 

We 

eks 

Numb 

er  of 

Ads 

First  Ad 

Between  Mail 

ing 

Ree 

eived 

At 

to  Group 

of  Last 

Ad 

to 

Four-Week 

and  Start  of 

Group 

and 

Start 

Percentage 

Decrease 

Recall 

Group 

Interval 

Ls 

Interviewing 

of  Int 

erview 

ing 

in 

Reca 

11 

Percentage 

13 

1 

1   . 

1 

17 

14 

1 

5 

5 

100 

0 

15 

3 

9 

1 

36 

16 

3 

13 

5 

72 

10 

17 

5 

17 

1 

33 

18 

5 

21 

5 

21* 

26 

19 

7 

25 

1 

42 

20 

7 

29 

5 

49 

21 

21 

9 

33 

1 

49 

22 

9 

37 

5 

40 

29 

23 

11 

41 

1 

39 

24 

11 

45 

5 

33 

26 

25 

13 

49 

1 

43 

26 

13 

53 

5 

27 

31 

Measurement  that  was  discarded  ■'by  Zielske 
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I  treated  the  data  in  the  following  fashion: 

1.  The  raw  recall  percentages  for  observations  a  week  after  the 
last  advertising  exposure,  for  the  weekly  and  the  every- four-weeks  ex- 
posure schedules,  were  plotted  as  in  Figures  1  and  2,  taken  from  Table  1. 
For  the  four-week-exposure  group,  I  also  plotted  the  raw  recall  percentages 
for  observations  five  weeks  after  the  last  advertising  exposure  (computed 
from  the  raw  "decrease  in  recall"  data  in  Figure  3. 


Figure  3 
2.  The  data  for  decrease  in  recall  from  one  week  after  the  last  ex- 
posure to  five  weeks  after  the  last  exposure,  for  the  every-four-weeks 
exposure  groups,  are  plotted  in  Figure  4,  and  are  smoothed  by  eye.  This 
curve  could  be  fitted  analytically,  but  the  choice  of  function  could  not 
conceivably  affect  the  conclusions  drawn  below.  Nor  is  there  compelling 
scientific  logic  in  favor  of  analytical  functions  in  preference  to  eye- 
and-hand  functions.  Furthermore,  it  would  not  be  feasible  to  fit  the 
other  curves  in  the  paper  analytically.  But  again,  the  conclusions  are 
not  sensitive  to  the  exactness  of  the  drawing,  so  there  is  no  real  prob- 
lem. 


Figure  4 
3a.  Smoothed  curves  for  the  proportions  of  the  weekly  exposure 
groups  recalling  the  campaign  were  drawn  by  eye  in  Figure  5.  The  week- 
to-week  forgetting  during  the  13-week  exposure  period  was  ignored  because 
of  its  small  magnitude. 


Figure  5 


3b.  Smoothed  curves  were  drawn  for  the  proportions  of  the  every-four- 
weeks  groups  remembering  the  campaign.  The  procedure  was  as  follows: 
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(i)  Draw  a  smooth  curve  (the  rising  portion  of  the  solid  line  in  Figure  b) 
through  the  observations  in  Figure  2.   (ii)  Draw  in  the  estimated  recall 
points  four  weeks  after  each  observation,  e.g.,  3%  [ (1-.85) (.20) ]  at  week 
5,  and  10%  [ (1-.66) (.30)]  at  week  13,  using  the  four-week  decrease-in-recall 
function  in  Figure  4.   (iii)  Draw  in  the  recall  percentages  estimated  from 
the  smoothed  curves  for  unobserved  numbers  of  exposures,  using  the  smoothed 
envelope  curve,  e.g.,  26%  at  5  weeks,  and  34%  at  13  weeks,   (iv)  Draw  in 
the  recall  percentages  for  those  number  of  exposures  four  weeks  later,  e.>... 
6%  at  9  weeks.   (v)  Connect  all  the  points  in  Figure  b. 


Figure  6 
4.  Calculate  the  area  under  the  recall  curves  in  Figures  5  and  b  by 
integration  (counting  the  squares  on  the  graph  paper  is  a  simple  and 
effective  device) .  This  yields  the  total  remembrances  for  the  two  types 
of  campaigns. 

RESULTS 

1.  The  weekly  schedule  reaches  a  higher  peak  recall  percentage  than 
does  the  every-four-weeks  schedule,  as  stated  by  P-Z  (1958) .   But  this 

is  no  surprise,  nor  of  great  importance. 

2.  The  rate  of  forgetting  the  entire  campaign  after  Its  completion 
is  slower  after  13  exposures  spread  over  a  year  than  after  13  exposures 
spread  over  13  weeks  as  noted  by  P-Z;  the  decrease  in  recall  from  one  to 
five  weeks  after  the  last  of  the  thirteen  ads  is  70%  for  the  weekly 
schedule,  but  only  29%  for  the  four-weekly  schedules.   Xhla  BMXM  tliat 
though  recall  is  higher  for  the  weekly  schedule  immediately  after  the 
13th  exposure  than  for  the  every-four-weeks  groups,  after  about  17  weeks 
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the  rate  of  recall  becomes  higher  and  stays  higher  for  the  four-week 
group  thereafter.   (Compare  Figure  6  to  Figure  5.)   But  this  is  of  more 
interest  to  the  psychologist  than  to  the  marketing  decision-maker. 

3.  The  relevant  measure  of  an  advertising  campaign's  impact  is 
the  number  of  "recall  weeks,"  that  is,  the  number  of  weeks  multiplied  by 
the  appropriate  recall  rate.   This  measure  reveals  the  most  important 
outcome  of  this  study:   the  total  amount  of  advertising  impact  (as  measured 
by  recall-weeks)  is  much  higher  for  the  every-four -weeks  schedule  than 
for  the  weekly  schedule.   For  the  every-four-weeks  schedule  there  are 
roughly  2130  recall-weeks,  compared  with  between  1054  and  1076  recall- 
weeks  for  the  weekly  campaign. 

The  implication  is  clear:  A  given  advertising  budget  is  most  effi- 
cient if  it  is  spread  out  over  the  maximum  period  rather  than  concen- 
trated in  a  single  burst.   Several  bursts  will  be  better  than  one  burst 
but  worse  than  an  even  spread,  by  implication.  This  stark  conclusion  was 
not  drawn  by  Pomerance  and  Zielske  though  it  is  implied  in  their  finding 
about  the  higher  average  recall  with  the  monthly  schedule;  and  this  con- 
clusion requires  the  data  and  analysis  shown  here  to  substantiate  it. 
It  contradicts  the  assertion  by  Rao  (1970,  p.  55)  but  Rao  presented  no 
relevant  data.   It  does  not  agree  with  Strong's  result  for  a  simulated 
comparison  of  bi-weekly  flights  spaced  throughout  the  year  versus  monthly 
exposures,  but  tnis  result  of  Strong's  is  several  logical  and  assump- 
tional  steps  removed  from  actually-observed  data.  And  it  should  be 
remembered  that  where  they  overlap,  Strong's  raw  data  coincide  nicely 
with  the  P-Z  raw  data. 
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DISCUSSION 

The  conclusions  that  may  be  drawn  from  Zielske's  (and  Strong's)  data 
are  only  as  general  as  the  data  themselves.   The  range  of  the  data  is 
limited  as  follows:   (1)  the  advertised  product  was  a  food,  "a  staple  in 
almost  every  home"  (p.  1  of  Foote,  Cone  and  Belding  research  report). 
The  results  might  be  different  for  a  durable  or  another  sort  of  product 
that  differs  in  interpurchase  interval  and  in  other  ways.   (2)  The  data 
were  generated  by  just  one  advertising  medium,  direct  mail,  and  direct 
mail  would  not  be  a  likely  medium  for  a  staple  food  product.   (3)  The 
measure  of  advertising  impact  was  unaided  recall.  A  sales  measure  might 
show  a  different  sort  of  response  pattern.  And  a  recall  measure  has  a 
built-in  saturation  limit  (100%)  that  is  not  relevant  for  actual  purchases. 
Furthermore,  the  relationship  of  recall  to  sales  might  be  different  at 
different  points  in  the  marketing  campaign.   (4)  All  persons  received 
advertisements  starting  in  the  same  week.  There  might  have  been  uncon- 
trolled seasonal  or  other  time  effects  that  influenced  the  results. 

These  qualifications  and  criticisms  are  severe.   But  the  response 
to  them  is  strong:   These  are  the  only  data  there  are.   And  until  a  wide 
range  of  data  are  produced,  the  results  from  these  data  must  have  some 
claim  on  our  belief,  especially  as  they  have  been  so  relied  on  for  other 
purposes  in  the  past.  The  fact  that  Strong's  data,  from  an  experiment 
run  at  another  time  and  with  a  different  product,  fit  well  with  Zielske's 
data,  reduces  concern  about  qualifications  1  and  4,  however. 

Zielske  tested  out  only  two  patterns  of  advertising  spacing,  once- 
a-week  for  thirteen  weeks  and  then  no  more  advertising,  and  once-a-month 
for  13  months.   (Strong  also  tested  a  bi-weekly  schedule.)   But  there 


-9- 

are  many  other  possible  schedules,  especially  for  new  products,  including 
a  heavy  dose  of  advertising  tD  start  with  and  then  a  lower  advertising 
level.   (This  schedule  is,  in  fact,  shown  deductively  to  be  optimal  by 
Levy  and  Simon,  forthcoming.)   Some  pattern  other  than  even  spacing  of 
advertisements  might  be  better  (once  the  equilibrium  level  for  a  new 
product  is  attained) .  Further  experiments  on  a  few  of  the  most  reason- 
able-seeming patterns  would  be  necessary  to  make  this  possibility  seem 
less  likely. 

The  reader  may  wonder  how  these  findings  may  be  reconciled  with  the 
conclusion  put  forth  by  Rao  (1970)  and  Ackhof f-Emshof f  (1975a,  p.  10) 
that  pulsing  can  be  better  than  spaced  advertising.  I  cannot  answer  this 
question,  because  the  raw  data  are  shown  in  neither  discussion,  and  no 
answer  was  received  to  my  requests  for  the  data. 

SUMMARY  AND  CONCLUSIONS 

The  P-Z  experiment  pioneered  a  very  useful  method  for  understand- 
ing the  effects  of  advertising.  It  is  good  that  the  raw  data  are  finally 
available  so  that  readers  can  know  exactly  what  was  done,  and  what  the 
results  were. 

The  most  important  finding  is  that  for  a  given  advertising  budget 
and  a  given  campaign  time  period,  a  campaign  that  spreads  the  exposures 
as  widely  as  possible  over  the  period  produces  much  more  sales  impact 
(as  measured  in  recall  weeks)  than  does  a  pulsed  campaign.  And  the  ad- 
vantage of  the  spaced  campaign  is  very  large  in  percentage  terms.   This 
present  publication  is  the  first  solid  direct  evidence  for  this  proposi- 
tion. 


APPENDIX:   HOW  THE  P-Z  DATA 
NOW  APPEAR  IN  PRINT 


The  saga  of  how  this  article  and  the  P-Z  data  finally  appear  in 
print  is  rather  unusual,  and  I  believe  that  it  is  illuminating  and  in- 
structive.  I  hope  the  reader  will  understand  the  intrusion  of  such 
personal  material  into  a  professional  article. 

The  first  draft  of  this  piece  was  written  more  than  a  decade  ago, 
and  the  scotch  tape  on  the  manuscript  has  already  cracked.   In  1963, 
when  teaching  a  course  on  research  methods  in  advertising  and  communi- 
cations— my  first  semester  as  a  college  teacher — I  wished  to  use  Zielske 
(1959)  as  a  case  study  for  class  analysis.  But  when  reading  it  closely 
so  as  to  prepare  an  answer  to  one  of  my  stock  class  questions — "What 
exactly  did  the  author  do?" — I  found  that  I  could  not  answer  the  question 
myself.   So  I  wrote  to  Zielske,  and  in  October,  1963  he  sent  me  the  raw 
data  that  underlie  Table  1. 

(Each  time  I  have  taught  a  research-methods  course  since  then  I 
have  assigned  the  piece  by  Zielske  near  the  beginning  of  the  course. 
When  I  ask  "What  exactly  did  the  author  do?"  the  students  at  first  give 
confident  answers,  but  then  there  quickly  appear  contradictions  between 
them,  such  as  whether  the  total  sample  size  was  3,650  or  13,000  or 
15,000  or  a  variety  of  other  suggestions.  With  as  much  sternness  and 
drama  as  I  can  muster,  I  say  "You  mean  you  don't  know  how  many  groups 
there  were?  Better  read  it  again  for  next  week."  And  the  next  week, 
of  course,  the  students  are  still  confused,  though  the  best  ones  say 
"The  answers  cannot  be  found  in  the  article.") 

A  couple  of  years  later  I  was  revising  a  manuscript  on  the  adver- 
tising response  function  and  economies  of  scale,  and  I  wished  to 
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